Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 79215 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 7.9 MiB |
| Average record size in memory | 104.0 B |
Variable types
| Numeric | 13 |
|---|
df_index is highly correlated with X_22 | High correlation |
X_19 is highly correlated with X_20 and 2 other fields | High correlation |
X_20 is highly correlated with X_19 and 2 other fields | High correlation |
X_21 is highly correlated with X_19 and 2 other fields | High correlation |
X_22 is highly correlated with df_index and 3 other fields | High correlation |
X_31 is highly correlated with X_33 | High correlation |
X_33 is highly correlated with X_31 | High correlation |
df_index is highly correlated with X_22 | High correlation |
X_19 is highly correlated with X_20 and 2 other fields | High correlation |
X_20 is highly correlated with X_19 and 2 other fields | High correlation |
X_21 is highly correlated with X_19 and 2 other fields | High correlation |
X_22 is highly correlated with df_index and 3 other fields | High correlation |
X_30 is highly correlated with X_32 | High correlation |
X_32 is highly correlated with X_30 | High correlation |
X_19 is highly correlated with X_21 | High correlation |
X_20 is highly correlated with X_21 and 1 other fields | High correlation |
X_21 is highly correlated with X_19 and 1 other fields | High correlation |
X_22 is highly correlated with X_20 | High correlation |
df_index is highly correlated with X_19 and 3 other fields | High correlation |
X_19 is highly correlated with df_index and 3 other fields | High correlation |
X_20 is highly correlated with df_index and 3 other fields | High correlation |
X_21 is highly correlated with df_index and 3 other fields | High correlation |
X_22 is highly correlated with df_index and 3 other fields | High correlation |
df_index is uniformly distributed | Uniform |
Reproduction
| Analysis started | 2022-08-07 05:50:57.643680 |
|---|---|
| Analysis finished | 2022-08-07 05:51:24.768451 |
| Duration | 27.12 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
| Distinct | 39608 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19803.25 |
| Minimum | 0 |
|---|---|
| Maximum | 39607 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1980 |
| Q1 | 9901.5 |
| median | 19803 |
| Q3 | 29705 |
| 95-th percentile | 37626.3 |
| Maximum | 39607 |
| Range | 39607 |
| Interquartile range (IQR) | 19803.5 |
Descriptive statistics
| Standard deviation | 11433.77256 |
|---|---|
| Coefficient of variation (CV) | 0.5773684907 |
| Kurtosis | -1.199999999 |
| Mean | 19803.25 |
| Median Absolute Deviation (MAD) | 9902 |
| Skewness | 1.6561712 × 10-9 |
| Sum | 1568714449 |
| Variance | 130731155.1 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 2 | < 0.1% |
| 26407 | 2 | < 0.1% |
| 26400 | 2 | < 0.1% |
| 26401 | 2 | < 0.1% |
| 26402 | 2 | < 0.1% |
| 26403 | 2 | < 0.1% |
| 26404 | 2 | < 0.1% |
| 26405 | 2 | < 0.1% |
| 26406 | 2 | < 0.1% |
| 26408 | 2 | < 0.1% |
| Other values (39598) | 79195 |
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 1 | 2 | |
| 2 | 2 | |
| 3 | 2 | |
| 4 | 2 | |
| 5 | 2 | |
| 6 | 2 | |
| 7 | 2 | |
| 8 | 2 | |
| 9 | 2 |
| Value | Count | Frequency (%) |
| 39607 | 1 | |
| 39606 | 2 | |
| 39605 | 2 | |
| 39604 | 2 | |
| 39603 | 2 | |
| 39602 | 2 | |
| 39601 | 2 | |
| 39600 | 2 | |
| 39599 | 2 | |
| 39598 | 2 |
| Distinct | 84 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.239343432 |
| Minimum | 2.86 |
|---|---|
| Maximum | 3.75 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.0 KiB |
Quantile statistics
| Minimum | 2.86 |
|---|---|
| 5-th percentile | 3.09 |
| Q1 | 3.16 |
| median | 3.22 |
| Q3 | 3.31 |
| 95-th percentile | 3.45 |
| Maximum | 3.75 |
| Range | 0.89 |
| Interquartile range (IQR) | 0.15 |
Descriptive statistics
| Standard deviation | 0.1102015904 |
|---|---|
| Coefficient of variation (CV) | 0.03401973046 |
| Kurtosis | -0.2850142993 |
| Mean | 3.239343432 |
| Median Absolute Deviation (MAD) | 0.07 |
| Skewness | 0.4890459561 |
| Sum | 256604.59 |
| Variance | 0.01214439054 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 3.19 | 3195 | 4.0% |
| 3.18 | 3135 | 4.0% |
| 3.16 | 3068 | 3.9% |
| 3.17 | 3059 | 3.9% |
| 3.2 | 2904 | 3.7% |
| 3.22 | 2878 | 3.6% |
| 3.21 | 2844 | 3.6% |
| 3.13 | 2769 | 3.5% |
| 3.15 | 2762 | 3.5% |
| 3.14 | 2759 | 3.5% |
| Other values (74) | 49842 |
| Value | Count | Frequency (%) |
| 2.86 | 1 | < 0.1% |
| 2.89 | 3 | < 0.1% |
| 2.9 | 4 | < 0.1% |
| 2.91 | 5 | < 0.1% |
| 2.92 | 4 | < 0.1% |
| 2.93 | 9 | < 0.1% |
| 2.94 | 10 | < 0.1% |
| 2.95 | 11 | < 0.1% |
| 2.96 | 19 | |
| 2.97 | 42 |
| Value | Count | Frequency (%) |
| 3.75 | 1 | < 0.1% |
| 3.74 | 1 | < 0.1% |
| 3.72 | 1 | < 0.1% |
| 3.71 | 1 | < 0.1% |
| 3.69 | 1 | < 0.1% |
| 3.66 | 2 | |
| 3.65 | 1 | < 0.1% |
| 3.64 | 2 | |
| 3.63 | 4 | |
| 3.62 | 2 |
| Distinct | 79 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.184119422 |
| Minimum | 2.83 |
|---|---|
| Maximum | 3.67 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.0 KiB |
Quantile statistics
| Minimum | 2.83 |
|---|---|
| 5-th percentile | 3.03 |
| Q1 | 3.1 |
| median | 3.18 |
| Q3 | 3.27 |
| 95-th percentile | 3.35 |
| Maximum | 3.67 |
| Range | 0.84 |
| Interquartile range (IQR) | 0.17 |
Descriptive statistics
| Standard deviation | 0.1052249117 |
|---|---|
| Coefficient of variation (CV) | 0.03304678556 |
| Kurtosis | -0.7258073828 |
| Mean | 3.184119422 |
| Median Absolute Deviation (MAD) | 0.08 |
| Skewness | 0.07319835121 |
| Sum | 252230.02 |
| Variance | 0.01107228205 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 3.11 | 2888 | 3.6% |
| 3.09 | 2738 | 3.5% |
| 3.12 | 2699 | 3.4% |
| 3.1 | 2643 | 3.3% |
| 3.26 | 2559 | 3.2% |
| 3.29 | 2513 | 3.2% |
| 3.08 | 2449 | 3.1% |
| 3.13 | 2417 | 3.1% |
| 3.28 | 2369 | 3.0% |
| 3.25 | 2361 | 3.0% |
| Other values (69) | 53579 |
| Value | Count | Frequency (%) |
| 2.83 | 8 | < 0.1% |
| 2.84 | 9 | < 0.1% |
| 2.85 | 16 | |
| 2.86 | 8 | < 0.1% |
| 2.87 | 18 | |
| 2.88 | 15 | < 0.1% |
| 2.89 | 29 | |
| 2.9 | 27 | |
| 2.91 | 38 | |
| 2.92 | 34 |
| Value | Count | Frequency (%) |
| 3.67 | 1 | < 0.1% |
| 3.62 | 2 | |
| 3.61 | 2 | |
| 3.59 | 2 | |
| 3.58 | 3 | |
| 3.57 | 2 | |
| 3.56 | 1 | < 0.1% |
| 3.55 | 2 | |
| 3.54 | 2 | |
| 3.53 | 4 |
| Distinct | 77 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.173294326 |
| Minimum | 2.83 |
|---|---|
| Maximum | 3.68 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.0 KiB |
Quantile statistics
| Minimum | 2.83 |
|---|---|
| 5-th percentile | 3.03 |
| Q1 | 3.09 |
| median | 3.16 |
| Q3 | 3.24 |
| 95-th percentile | 3.37 |
| Maximum | 3.68 |
| Range | 0.85 |
| Interquartile range (IQR) | 0.15 |
Descriptive statistics
| Standard deviation | 0.1066429125 |
|---|---|
| Coefficient of variation (CV) | 0.03360637293 |
| Kurtosis | -0.2776467029 |
| Mean | 3.173294326 |
| Median Absolute Deviation (MAD) | 0.07 |
| Skewness | 0.5241003791 |
| Sum | 251372.51 |
| Variance | 0.01137271079 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 3.1 | 3514 | 4.4% |
| 3.11 | 3513 | 4.4% |
| 3.09 | 3479 | 4.4% |
| 3.12 | 3400 | 4.3% |
| 3.08 | 3180 | 4.0% |
| 3.13 | 2925 | 3.7% |
| 3.07 | 2857 | 3.6% |
| 3.14 | 2723 | 3.4% |
| 3.15 | 2666 | 3.4% |
| 3.17 | 2659 | 3.4% |
| Other values (67) | 48299 |
| Value | Count | Frequency (%) |
| 2.83 | 4 | < 0.1% |
| 2.84 | 2 | < 0.1% |
| 2.86 | 4 | < 0.1% |
| 2.87 | 7 | < 0.1% |
| 2.88 | 15 | < 0.1% |
| 2.89 | 19 | |
| 2.9 | 19 | |
| 2.91 | 30 | |
| 2.92 | 38 | |
| 2.93 | 41 |
| Value | Count | Frequency (%) |
| 3.68 | 1 | < 0.1% |
| 3.61 | 3 | < 0.1% |
| 3.58 | 5 | < 0.1% |
| 3.57 | 3 | < 0.1% |
| 3.56 | 8 | < 0.1% |
| 3.55 | 5 | < 0.1% |
| 3.54 | 11 | |
| 3.53 | 9 | |
| 3.52 | 12 | |
| 3.51 | 21 |
| Distinct | 90 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.232379979 |
| Minimum | 2.85 |
|---|---|
| Maximum | 3.82 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.0 KiB |
Quantile statistics
| Minimum | 2.85 |
|---|---|
| 5-th percentile | 3.07 |
| Q1 | 3.14 |
| median | 3.23 |
| Q3 | 3.32 |
| 95-th percentile | 3.4 |
| Maximum | 3.82 |
| Range | 0.97 |
| Interquartile range (IQR) | 0.18 |
Descriptive statistics
| Standard deviation | 0.1086628494 |
|---|---|
| Coefficient of variation (CV) | 0.03361697887 |
| Kurtosis | -0.6259001337 |
| Mean | 3.232379979 |
| Median Absolute Deviation (MAD) | 0.09 |
| Skewness | 0.05365041662 |
| Sum | 256052.98 |
| Variance | 0.01180761485 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 3.13 | 2594 | 3.3% |
| 3.3 | 2496 | 3.2% |
| 3.19 | 2481 | 3.1% |
| 3.18 | 2444 | 3.1% |
| 3.31 | 2423 | 3.1% |
| 3.12 | 2409 | 3.0% |
| 3.14 | 2397 | 3.0% |
| 3.21 | 2394 | 3.0% |
| 3.16 | 2374 | 3.0% |
| 3.22 | 2330 | 2.9% |
| Other values (80) | 54873 |
| Value | Count | Frequency (%) |
| 2.85 | 5 | < 0.1% |
| 2.86 | 4 | < 0.1% |
| 2.87 | 8 | < 0.1% |
| 2.88 | 9 | < 0.1% |
| 2.89 | 12 | |
| 2.9 | 14 | |
| 2.91 | 16 | |
| 2.92 | 21 | |
| 2.93 | 23 | |
| 2.94 | 28 |
| Value | Count | Frequency (%) |
| 3.82 | 1 | |
| 3.8 | 1 | |
| 3.79 | 1 | |
| 3.78 | 1 | |
| 3.77 | 1 | |
| 3.75 | 1 | |
| 3.73 | 1 | |
| 3.71 | 1 | |
| 3.69 | 2 | |
| 3.66 | 2 |
| Distinct | 45 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.37883772 |
| Minimum | 0.57 |
|---|---|
| Maximum | 2.11 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.0 KiB |
Quantile statistics
| Minimum | 0.57 |
|---|---|
| 5-th percentile | 1.35 |
| Q1 | 1.37 |
| median | 1.37 |
| Q3 | 1.38 |
| 95-th percentile | 1.41 |
| Maximum | 2.11 |
| Range | 1.54 |
| Interquartile range (IQR) | 0.01 |
Descriptive statistics
| Standard deviation | 0.03008779557 |
|---|---|
| Coefficient of variation (CV) | 0.02182112886 |
| Kurtosis | 179.89617 |
| Mean | 1.37883772 |
| Median Absolute Deviation (MAD) | 0.01 |
| Skewness | -3.208254837 |
| Sum | 109224.63 |
| Variance | 0.0009052754424 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=45)
| Value | Count | Frequency (%) |
| 1.37 | 23423 | |
| 1.38 | 21775 | |
| 1.36 | 11932 | |
| 1.39 | 10318 | |
| 1.35 | 3963 | 5.0% |
| 1.4 | 2738 | 3.5% |
| 1.48 | 917 | 1.2% |
| 1.49 | 782 | 1.0% |
| 1.47 | 757 | 1.0% |
| 1.34 | 621 | 0.8% |
| Other values (35) | 1989 | 2.5% |
| Value | Count | Frequency (%) |
| 0.57 | 24 | < 0.1% |
| 1.21 | 1 | < 0.1% |
| 1.27 | 2 | < 0.1% |
| 1.28 | 5 | < 0.1% |
| 1.29 | 2 | < 0.1% |
| 1.3 | 7 | < 0.1% |
| 1.31 | 16 | < 0.1% |
| 1.32 | 26 | < 0.1% |
| 1.33 | 66 | 0.1% |
| 1.34 | 621 |
| Value | Count | Frequency (%) |
| 2.11 | 1 | |
| 2.09 | 1 | |
| 2.03 | 1 | |
| 2 | 1 | |
| 1.99 | 1 | |
| 1.87 | 1 | |
| 1.78 | 1 | |
| 1.75 | 1 | |
| 1.68 | 1 | |
| 1.63 | 1 |
| Distinct | 90 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.571119864 |
| Minimum | 0.6 |
|---|---|
| Maximum | 7.89 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.0 KiB |
Quantile statistics
| Minimum | 0.6 |
|---|---|
| 5-th percentile | 1.51 |
| Q1 | 1.53 |
| median | 1.55 |
| Q3 | 1.6 |
| 95-th percentile | 1.7 |
| Maximum | 7.89 |
| Range | 7.29 |
| Interquartile range (IQR) | 0.07 |
Descriptive statistics
| Standard deviation | 0.07509934482 |
|---|---|
| Coefficient of variation (CV) | 0.04779988246 |
| Kurtosis | 1061.112532 |
| Mean | 1.571119864 |
| Median Absolute Deviation (MAD) | 0.02 |
| Skewness | 14.14912024 |
| Sum | 124456.26 |
| Variance | 0.005639911593 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1.54 | 12208 | |
| 1.53 | 11765 | |
| 1.52 | 9055 | |
| 1.55 | 8228 | |
| 1.56 | 5340 | 6.7% |
| 1.51 | 4235 | 5.3% |
| 1.57 | 3253 | 4.1% |
| 1.58 | 2135 | 2.7% |
| 1.61 | 2074 | 2.6% |
| 1.62 | 2064 | 2.6% |
| Other values (80) | 18858 |
| Value | Count | Frequency (%) |
| 0.6 | 29 | < 0.1% |
| 1.43 | 2 | < 0.1% |
| 1.44 | 1 | < 0.1% |
| 1.45 | 2 | < 0.1% |
| 1.46 | 5 | < 0.1% |
| 1.47 | 5 | < 0.1% |
| 1.48 | 16 | < 0.1% |
| 1.49 | 92 | 0.1% |
| 1.5 | 875 | 1.1% |
| 1.51 | 4235 |
| Value | Count | Frequency (%) |
| 7.89 | 1 | |
| 7.21 | 1 | |
| 3.48 | 1 | |
| 3.11 | 1 | |
| 2.96 | 1 | |
| 2.8 | 1 | |
| 2.79 | 1 | |
| 2.76 | 2 | |
| 2.7 | 1 | |
| 2.67 | 1 |
| Distinct | 45 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.362823076 |
| Minimum | 0.57 |
|---|---|
| Maximum | 2.45 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.0 KiB |
Quantile statistics
| Minimum | 0.57 |
|---|---|
| 5-th percentile | 1.34 |
| Q1 | 1.35 |
| median | 1.36 |
| Q3 | 1.37 |
| 95-th percentile | 1.38 |
| Maximum | 2.45 |
| Range | 1.88 |
| Interquartile range (IQR) | 0.02 |
Descriptive statistics
| Standard deviation | 0.02936587865 |
|---|---|
| Coefficient of variation (CV) | 0.02154782903 |
| Kurtosis | 266.5905596 |
| Mean | 1.362823076 |
| Median Absolute Deviation (MAD) | 0.01 |
| Skewness | -4.140794936 |
| Sum | 107956.03 |
| Variance | 0.0008623548289 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=45)
| Value | Count | Frequency (%) |
| 1.36 | 27380 | |
| 1.35 | 21542 | |
| 1.37 | 16383 | |
| 1.34 | 5743 | 7.2% |
| 1.38 | 3382 | 4.3% |
| 1.46 | 1316 | 1.7% |
| 1.47 | 901 | 1.1% |
| 1.45 | 766 | 1.0% |
| 1.33 | 674 | 0.9% |
| 1.39 | 354 | 0.4% |
| Other values (35) | 774 | 1.0% |
| Value | Count | Frequency (%) |
| 0.57 | 31 | < 0.1% |
| 1.26 | 1 | < 0.1% |
| 1.27 | 3 | < 0.1% |
| 1.28 | 5 | < 0.1% |
| 1.29 | 9 | < 0.1% |
| 1.3 | 12 | < 0.1% |
| 1.31 | 13 | < 0.1% |
| 1.32 | 80 | 0.1% |
| 1.33 | 674 | 0.9% |
| 1.34 | 5743 |
| Value | Count | Frequency (%) |
| 2.45 | 1 | |
| 2.29 | 1 | |
| 2.14 | 1 | |
| 2.11 | 1 | |
| 1.96 | 1 | |
| 1.92 | 1 | |
| 1.9 | 1 | |
| 1.84 | 2 | |
| 1.77 | 1 | |
| 1.7 | 1 |
| Distinct | 120 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.596088115 |
| Minimum | 0.61 |
|---|---|
| Maximum | 8.95 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.0 KiB |
Quantile statistics
| Minimum | 0.61 |
|---|---|
| 5-th percentile | 1.53 |
| Q1 | 1.55 |
| median | 1.57 |
| Q3 | 1.61 |
| 95-th percentile | 1.77 |
| Maximum | 8.95 |
| Range | 8.34 |
| Interquartile range (IQR) | 0.06 |
Descriptive statistics
| Standard deviation | 0.1192228405 |
|---|---|
| Coefficient of variation (CV) | 0.07469690391 |
| Kurtosis | 777.5093417 |
| Mean | 1.596088115 |
| Median Absolute Deviation (MAD) | 0.02 |
| Skewness | 15.57564374 |
| Sum | 126434.12 |
| Variance | 0.0142140857 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1.55 | 11704 | |
| 1.56 | 10880 | |
| 1.57 | 8933 | |
| 1.54 | 7754 | |
| 1.58 | 6141 | 7.8% |
| 1.53 | 4456 | 5.6% |
| 1.59 | 3984 | 5.0% |
| 1.6 | 2678 | 3.4% |
| 1.52 | 2002 | 2.5% |
| 1.61 | 1809 | 2.3% |
| Other values (110) | 18874 |
| Value | Count | Frequency (%) |
| 0.61 | 156 | 0.2% |
| 1.43 | 1 | < 0.1% |
| 1.48 | 5 | < 0.1% |
| 1.49 | 15 | < 0.1% |
| 1.5 | 97 | 0.1% |
| 1.51 | 602 | 0.8% |
| 1.52 | 2002 | 2.5% |
| 1.53 | 4456 | 5.6% |
| 1.54 | 7754 | |
| 1.55 | 11704 |
| Value | Count | Frequency (%) |
| 8.95 | 1 | |
| 8.07 | 1 | |
| 7.86 | 1 | |
| 7.81 | 1 | |
| 7.6 | 1 | |
| 7.53 | 1 | |
| 6.54 | 1 | |
| 5.97 | 1 | |
| 5.76 | 1 | |
| 5.6 | 1 |
X_34
Real number (ℝ≥0)
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.95020678 |
| Minimum | 12.84 |
|---|---|
| Maximum | 13.23 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.0 KiB |
Quantile statistics
| Minimum | 12.84 |
|---|---|
| 5-th percentile | 12.87 |
| Q1 | 12.92 |
| median | 12.96 |
| Q3 | 12.99 |
| 95-th percentile | 13.01 |
| Maximum | 13.23 |
| Range | 0.39 |
| Interquartile range (IQR) | 0.07 |
Descriptive statistics
| Standard deviation | 0.04412292821 |
|---|---|
| Coefficient of variation (CV) | 0.003407121521 |
| Kurtosis | -0.6989091487 |
| Mean | 12.95020678 |
| Median Absolute Deviation (MAD) | 0.03 |
| Skewness | -0.4221656177 |
| Sum | 1025850.63 |
| Variance | 0.001946832794 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=27)
| Value | Count | Frequency (%) |
| 12.97 | 10031 | |
| 12.99 | 9820 | |
| 12.96 | 7879 | |
| 12.94 | 6078 | 7.7% |
| 12.98 | 5410 | 6.8% |
| 12.92 | 4756 | 6.0% |
| 12.87 | 4377 | 5.5% |
| 13.01 | 4207 | 5.3% |
| 12.89 | 3960 | 5.0% |
| 13 | 3869 | 4.9% |
| Other values (17) | 18828 |
| Value | Count | Frequency (%) |
| 12.84 | 45 | 0.1% |
| 12.85 | 221 | 0.3% |
| 12.86 | 2038 | |
| 12.87 | 4377 | |
| 12.88 | 2283 | |
| 12.89 | 3960 | |
| 12.9 | 1759 | 2.2% |
| 12.91 | 3745 | |
| 12.92 | 4756 | |
| 12.93 | 2609 |
| Value | Count | Frequency (%) |
| 13.23 | 1 | < 0.1% |
| 13.09 | 1 | < 0.1% |
| 13.08 | 9 | < 0.1% |
| 13.07 | 7 | < 0.1% |
| 13.06 | 34 | < 0.1% |
| 13.05 | 55 | 0.1% |
| 13.04 | 339 | 0.4% |
| 13.03 | 945 | 1.2% |
| 13.02 | 995 | 1.3% |
| 13.01 | 4207 |
X_35
Real number (ℝ≥0)
| Distinct | 28 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.9204027 |
| Minimum | 12.81 |
|---|---|
| Maximum | 13.09 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.0 KiB |
Quantile statistics
| Minimum | 12.81 |
|---|---|
| 5-th percentile | 12.85 |
| Q1 | 12.87 |
| median | 12.92 |
| Q3 | 12.97 |
| 95-th percentile | 13 |
| Maximum | 13.09 |
| Range | 0.28 |
| Interquartile range (IQR) | 0.1 |
Descriptive statistics
| Standard deviation | 0.0521395629 |
|---|---|
| Coefficient of variation (CV) | 0.004035444104 |
| Kurtosis | -1.232695046 |
| Mean | 12.9204027 |
| Median Absolute Deviation (MAD) | 0.05 |
| Skewness | 0.1062605863 |
| Sum | 1023489.7 |
| Variance | 0.00271853402 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=28)
| Value | Count | Frequency (%) |
| 12.86 | 8153 | 10.3% |
| 12.88 | 7673 | 9.7% |
| 12.98 | 7429 | 9.4% |
| 12.96 | 5802 | 7.3% |
| 12.85 | 4833 | 6.1% |
| 12.9 | 4664 | 5.9% |
| 12.95 | 4595 | 5.8% |
| 12.87 | 4407 | 5.6% |
| 12.93 | 4365 | 5.5% |
| 12.91 | 4226 | 5.3% |
| Other values (18) | 23068 |
| Value | Count | Frequency (%) |
| 12.81 | 63 | 0.1% |
| 12.82 | 231 | 0.3% |
| 12.83 | 1274 | 1.6% |
| 12.84 | 1184 | 1.5% |
| 12.85 | 4833 | |
| 12.86 | 8153 | |
| 12.87 | 4407 | |
| 12.88 | 7673 | |
| 12.89 | 2815 | 3.6% |
| 12.9 | 4664 |
| Value | Count | Frequency (%) |
| 13.09 | 1 | < 0.1% |
| 13.07 | 7 | < 0.1% |
| 13.06 | 9 | < 0.1% |
| 13.05 | 57 | 0.1% |
| 13.04 | 62 | 0.1% |
| 13.03 | 280 | 0.4% |
| 13.02 | 877 | 1.1% |
| 13.01 | 971 | 1.2% |
| 13 | 3675 | |
| 12.99 | 3430 |
X_36
Real number (ℝ≥0)
| Distinct | 26 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.94169147 |
| Minimum | 12.84 |
|---|---|
| Maximum | 13.09 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.0 KiB |
Quantile statistics
| Minimum | 12.84 |
|---|---|
| 5-th percentile | 12.87 |
| Q1 | 12.9 |
| median | 12.95 |
| Q3 | 12.98 |
| 95-th percentile | 13.01 |
| Maximum | 13.09 |
| Range | 0.25 |
| Interquartile range (IQR) | 0.08 |
Descriptive statistics
| Standard deviation | 0.04801667182 |
|---|---|
| Coefficient of variation (CV) | 0.003710231535 |
| Kurtosis | -1.116694626 |
| Mean | 12.94169147 |
| Median Absolute Deviation (MAD) | 0.04 |
| Skewness | -0.1963443169 |
| Sum | 1025176.09 |
| Variance | 0.002305600772 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=26)
| Value | Count | Frequency (%) |
| 12.99 | 8549 | |
| 12.97 | 8530 | |
| 12.96 | 6895 | 8.7% |
| 12.87 | 6655 | 8.4% |
| 12.89 | 5784 | 7.3% |
| 12.94 | 5333 | 6.7% |
| 12.98 | 4643 | 5.9% |
| 12.92 | 4381 | 5.5% |
| 12.91 | 4031 | 5.1% |
| 13.01 | 3838 | 4.8% |
| Other values (16) | 20576 |
| Value | Count | Frequency (%) |
| 12.84 | 162 | 0.2% |
| 12.85 | 408 | 0.5% |
| 12.86 | 3245 | |
| 12.87 | 6655 | |
| 12.88 | 3402 | |
| 12.89 | 5784 | |
| 12.9 | 2063 | 2.6% |
| 12.91 | 4031 | |
| 12.92 | 4381 | |
| 12.93 | 2350 | 3.0% |
| Value | Count | Frequency (%) |
| 13.09 | 3 | < 0.1% |
| 13.08 | 5 | < 0.1% |
| 13.07 | 6 | < 0.1% |
| 13.06 | 37 | < 0.1% |
| 13.05 | 68 | 0.1% |
| 13.04 | 288 | 0.4% |
| 13.03 | 891 | 1.1% |
| 13.02 | 885 | 1.1% |
| 13.01 | 3838 | |
| 13 | 3667 |
X_37
Real number (ℝ≥0)
| Distinct | 28 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.91931642 |
| Minimum | 12.81 |
|---|---|
| Maximum | 13.08 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.0 KiB |
Quantile statistics
| Minimum | 12.81 |
|---|---|
| 5-th percentile | 12.85 |
| Q1 | 12.87 |
| median | 12.91 |
| Q3 | 12.97 |
| 95-th percentile | 13 |
| Maximum | 13.08 |
| Range | 0.27 |
| Interquartile range (IQR) | 0.1 |
Descriptive statistics
| Standard deviation | 0.05231542438 |
|---|---|
| Coefficient of variation (CV) | 0.004049395703 |
| Kurtosis | -1.231453122 |
| Mean | 12.91931642 |
| Median Absolute Deviation (MAD) | 0.05 |
| Skewness | 0.1474668409 |
| Sum | 1023403.65 |
| Variance | 0.002736903628 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=28)
| Value | Count | Frequency (%) |
| 12.86 | 8388 | 10.6% |
| 12.88 | 8279 | 10.5% |
| 12.98 | 7167 | 9.0% |
| 12.96 | 5644 | 7.1% |
| 12.85 | 4923 | 6.2% |
| 12.9 | 4640 | 5.9% |
| 12.87 | 4449 | 5.6% |
| 12.95 | 4388 | 5.5% |
| 12.93 | 4290 | 5.4% |
| 12.91 | 4083 | 5.2% |
| Other values (18) | 22964 |
| Value | Count | Frequency (%) |
| 12.81 | 34 | < 0.1% |
| 12.82 | 270 | 0.3% |
| 12.83 | 1312 | 1.7% |
| 12.84 | 1209 | 1.5% |
| 12.85 | 4923 | |
| 12.86 | 8388 | |
| 12.87 | 4449 | |
| 12.88 | 8279 | |
| 12.89 | 2881 | 3.6% |
| 12.9 | 4640 |
| Value | Count | Frequency (%) |
| 13.08 | 1 | < 0.1% |
| 13.07 | 8 | < 0.1% |
| 13.06 | 7 | < 0.1% |
| 13.05 | 47 | 0.1% |
| 13.04 | 56 | 0.1% |
| 13.03 | 331 | 0.4% |
| 13.02 | 930 | 1.2% |
| 13.01 | 870 | 1.1% |
| 13 | 3753 | |
| 12.99 | 3348 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| df_index | X_19 | X_20 | X_21 | X_22 | X_30 | X_31 | X_32 | X_33 | X_34 | X_35 | X_36 | X_37 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 3.11 | 3.17 | 3.06 | 3.13 | 1.49 | 1.69 | 1.46 | 1.74 | 12.99 | 12.88 | 12.89 | 12.99 |
| 1 | 1 | 2.97 | 3.11 | 2.91 | 3.20 | 1.49 | 1.67 | 1.45 | 1.66 | 12.92 | 12.87 | 12.89 | 12.93 |
| 2 | 2 | 3.04 | 3.04 | 3.01 | 3.12 | 1.49 | 1.69 | 1.46 | 1.68 | 12.97 | 12.87 | 12.87 | 13.00 |
| 3 | 3 | 3.05 | 3.01 | 3.02 | 3.08 | 1.47 | 1.68 | 1.47 | 1.68 | 12.91 | 12.97 | 12.99 | 12.92 |
| 4 | 4 | 3.04 | 3.07 | 3.00 | 3.12 | 1.49 | 1.68 | 1.47 | 1.82 | 12.96 | 12.85 | 12.91 | 12.96 |
| 5 | 5 | 3.22 | 3.20 | 3.16 | 3.22 | 1.50 | 1.65 | 1.48 | 1.67 | 12.96 | 12.91 | 13.01 | 12.99 |
| 6 | 6 | 3.24 | 3.11 | 3.20 | 3.20 | 1.46 | 1.77 | 1.47 | 1.94 | 12.95 | 12.89 | 12.94 | 12.86 |
| 7 | 7 | 3.25 | 3.08 | 3.20 | 3.18 | 1.47 | 1.72 | 1.48 | 2.00 | 13.01 | 12.86 | 12.87 | 12.88 |
| 8 | 8 | 3.12 | 3.18 | 3.13 | 3.11 | 1.50 | 1.64 | 1.46 | 1.64 | 12.97 | 13.00 | 12.86 | 12.88 |
| 9 | 9 | 3.00 | 3.09 | 3.03 | 3.08 | 1.51 | 1.68 | 1.48 | 1.99 | 12.98 | 12.85 | 12.94 | 12.97 |
Last rows
| df_index | X_19 | X_20 | X_21 | X_22 | X_30 | X_31 | X_32 | X_33 | X_34 | X_35 | X_36 | X_37 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 79205 | 39598 | 3.16 | 3.11 | 2.98 | 3.02 | 1.36 | 1.61 | 1.36 | 1.81 | 12.92 | 12.95 | 12.97 | 13.00 |
| 79206 | 39599 | 3.11 | 3.03 | 3.07 | 3.06 | 1.40 | 1.54 | 1.38 | 1.60 | 12.95 | 13.00 | 12.86 | 12.96 |
| 79207 | 39600 | 3.24 | 3.00 | 3.12 | 3.06 | 1.36 | 1.59 | 1.36 | 1.60 | 12.96 | 12.86 | 12.91 | 12.95 |
| 79208 | 39601 | 3.15 | 3.05 | 3.07 | 3.10 | 1.38 | 1.67 | 1.36 | 1.64 | 12.98 | 12.90 | 12.94 | 12.88 |
| 79209 | 39602 | 3.19 | 3.01 | 3.07 | 3.04 | 1.34 | 1.58 | 1.36 | 1.66 | 12.94 | 12.86 | 13.00 | 12.95 |
| 79210 | 39603 | 3.16 | 3.06 | 3.07 | 3.09 | 1.37 | 1.66 | 1.36 | 1.56 | 12.98 | 13.00 | 12.91 | 12.90 |
| 79211 | 39604 | 3.18 | 2.98 | 3.09 | 3.06 | 1.36 | 1.64 | 1.36 | 1.68 | 12.92 | 12.95 | 12.99 | 13.00 |
| 79212 | 39605 | 3.18 | 3.02 | 3.09 | 3.07 | 1.40 | 1.62 | 1.35 | 1.72 | 12.99 | 12.88 | 13.01 | 12.85 |
| 79213 | 39606 | 3.14 | 3.06 | 3.02 | 3.11 | 1.38 | 1.56 | 1.37 | 1.59 | 12.97 | 13.00 | 12.99 | 12.90 |
| 79214 | 39607 | 3.15 | 3.08 | 3.07 | 3.15 | 1.38 | 1.54 | 1.36 | 1.69 | 12.97 | 12.99 | 12.99 | 12.86 |